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ABSTRACT 

ParB proteins are one of the three essential compo- 
nents of partition systems that actively segregate 
bacterial chromosomes and plasmids. In binding to 
centromere sequences, ParB assembles as nucleo- 
protein structures called partition complexes. These 
assemblies are the substrates for the partitioning 
process that ensures DNA molecules are segregated 
to both sides of the cell. We recently identified the 
sopC centromere nucleotides required for binding to 
the ParB homologue of plasmid F, SopB. This 
analysis also suggested a role in sopC binding for 
an arginine residue, R219, located outside the 
helix-turn-helix (HTH) DNA-binding motif previously 
shown to be the only determinant for sopC-specific 
binding. Here, we demonstrated that the R219 
residue is critical for SopB binding to sopC during 
partition. Mutating R219 to alanine or lysine abol- 
ished partition by preventing partition complex 
assembly. Thus, specificity of SopB binding relies 
on two distinct motifs, an HTH and an arginine 
residue, which define a split DNA-binding domain 
larger than previously thought. Bioinformatic 
analysis over a broad range of chromosomal ParBs 
generalized our findings with the identification of a 
non-HTH positively charged residue essential for par- 
tition and centromere binding, present in a newly 
identified highly conserved motif. We propose that 
ParB proteins possess two DNA-binding motifs that 
form an extended centromere-binding domain, 
providing high specificity. 

INTRODUCTION 

Low-copy number plasmids and most bacterial chromo- 
somes have acquired partitioning (Par) mechanisms to 



ensure faithful segregation in dividing cells. This stable 
inheritance depends on active partition systems [reviewed 
in (1)]. Virtually, all Par systems are composed of three 
essential elements: a centromere, a centromere-binding 
protein (CBP) and an NTPase. Interactions between 
these partners are important for the partition process, 
and of these, the specific recognition of centromeres by 
CBPs to assemble partition complexes is crucial. 
Partition complexes are substrates for partition machi- 
neries, as they are responsible for the proper intracellular 
localization of the replicon onto which these are 
assembled. CBPs are functional equivalents of eukaryotic 
kinetochores, in which they act as the link between centro- 
meres and NTPases. 

Bacterial partition systems have been classified in three 
types according to the nature of the NTPase involved (2). 
The partition system we study here is of type I, represen- 
tative of the most widespread system, present on most 
plasmids and on all chromosomes carrying a Par system. 
Although, the molecular mechanisms underlying parti- 
tioning of type I systems are still to be unraveled, 
evidence is now accumulating that in vivo type I adenosine 
triphosphatases (ATPases) generally show dynamic be- 
haviors, either oscillating over the nucleoid (3-5) or 
forming metastable filaments (6,7). Most of the ParA 
proteins have also been found to polymerize in vitro in 
response to adenosine triphosphate (ATP) binding 
(7-10). These dynamic behaviors depend on the presence 
of the three Par elements. Notably, ParB stimulates ParA 
activities, such as polymerization (8,9) and ATP hydroly- 
sis (11,12). This latter activity is further stimulated with 
centromere-bound ParB (13). Partition complexes 
assemblies on centromere DNA are thus important for 
the regulation of the partition process. 

Bacterial centromeres, generically called parS sites, are 
diverse in composition and organization. Chromosomal 
parS sites typically consist of 16-bp inverted repeats clus- 
tered around the replication origin or spread throughout 
the Ori domain (14). Plasmid centromeres consist of 
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inverted and/or direct repeats [reviewed in (15)]. The 
repeats can be extensive, for example, sopC of the F 
plasmid consists of 12 tandem repeats of a 43-bp motif 
(16), and parS of pTAR contains 13 heptameric repeats 
that are separated by integral helical turns (17). By 
contrast, the partition site 0 B 3 of plasmid RK2/RP4 is a 
simpler palindromic core sequence of 13 bp, although 
related sites dispersed throughout this plasmid might 
also contribute to segregation (18). Centromeres of more 
complex organization also exist, such as that of plasmid 
PI (19). This ~80-bp parS consists of two arms that 
harbor non-symmetrical heptamer (A-box) and hexamer 
(B-box) DNA motifs, separated by a central binding site 
for the host-encoded DNA-bending protein, integration 
host factor. Despite this high diversity, ParBs generally 
bind to centromere sites through helix-turn-helix (HTH) 
or in some cases ribbon-helix-helix (RHH) motifs [for 
review (15)]. 

In F plasmid, the partitioning locus sop specifies the 
ATPase, SopA, the CBP, SopB and the centromere, 
sopC. Each of the 12 tandem 43-bp repeat contains a 
16-bp inverted repeat that constitutes the SopB-binding 
site (20). SopB is composed of several functional 
domains (Figure 1A). Owing to its high flexibility, the 
structure of the N-terminal part of SopB is unknown, 
which is also the case with others ParBs (21,22). This 
part contains the SopA-interaction domain (23), the 
arginine-finger and the putative arginine-loop motifs that 
stimulate SopA ATPase activity (13) and possibly a SopB 
dimer-dimer interaction region (24). SopB not only forms 
a SopB-sopC partition complex that recruits SopA but 
also extends the complex by spreading along the DNA 
on either side of sopC (25), and also probably in trans to 
different DNA molecules (26). By coating DNA, SopB 
forms extended partition complexes whose role in parti- 
tioning is not understood. In all cases, specific assembly of 
SopB on sopC to nucleate the formation of extended par- 
tition complexes is essential for F partitioning. 

Our present knowledge of the protein-centromere 
contacts that define binding specificity is based on struc- 
tural studies of only three ParB-parS complexes: KorB- 
0 B 3 (RP4), ParB-parS (PI) and SopB-sopC (F). KorB 
binds Ob3 through a classical HTH motif, and the recog- 
nition specificity is provided by two amino acid side chains 
external to the HTH motif (22). The PI partition complex 
is more complicated owing to the unusual organization of 
its centromere site (see earlier in the text). Each HTH 
motif of ParB dimer contacts an A-box on each parS 
arm, and an additional DNA-binding module contacting 
the B-box is provided by the dimerization domain (21). 
The strong specificity of PI ParB-parS interaction is 
shared between these two distinct DNA-binding motifs. 
Specificity determinants of SopB-sopC partition complex 
have been recently investigated at the nucleotide level by 
high-throughput analysis (27). This revealed that all 16 bp 
of sopC are required, to varying degrees, for maximal 
SopB binding strength. In addition, the co-crystal struc- 
ture of a SopB-sopC complex was solved at ~3.5 A (26), 
enabling prediction of the likely contacts between sopC 
and residues of the HTH motif (Figure 1C). Because 
previous results had suggested binding specificity was 



entirely contained within the HTH (23), it was interesting 
that an arginine residue (SopB-R219) outside the HTH 
was close to the sopC guanine at position 7 (sopC-Gl) in 
the co-crystal. 

To better understand the underlying mechanisms 
involved in plasmid partitioning and centromere binding, 
we have now investigated the predicted interaction 
between SopB-R219 and sopC-Gl. Strikingly, we found 
that this interaction is essential for SopB-sopC binding 
in vitro and is necessary for partition in vivo. These 
findings reveal that the specific contacts between SopB 
and sopC occur both inside and outside the HTH motif 
of SopB. Our results refine the basis of the high-binding 
specificity of F partition complex assembly. Based on 
sequence alignments and the known structure of a 
chromosomal ParB, we identified and showed that an 
arginine residue provides a specific contact with the 
chromosomal centromere. Extensive sequence analysis 
revealed that this positively charged residue is part of a 
newly defined and highly conserved motif present in 
chromosomal ParBs over the whole bacterial domain 
of life. 



MATERIALS AND METHODS 

Bacterial strains and plasmids 

Strains are derivatives of Escherichia coli K-12 or E. coli B 
(C2833) and are listed, together with plasmids, in 
Supplementary Tables SI and S2, respectively. 
Construction of plasmids is detailed in Supplementary 
Figure S4. 

Media and growth conditions 

Cultures were essentially grown at 37°C with aeration in 
LB (Miller, 1972) containing thymine (lOugml -1 ) and 
antibiotics (Sigma) as appropriate (ug ml -1 ): kanamycin 
(Km, 50), chloramphenicol (Cm, 10), streptomycin (Sm, 
200) and spectinomycin (Sp, 20 in liquid medium, 30 in 
solid). Luria Broth (LB) was supplemented with 1.5% 
agar (Difco) for solid medium. For stability assay, 
cultures were grown at the indicated temperature with 
aeration in LB or in M9 minimal medium supplemented 
with glucose (0.4%), casamino acids (0.2%), MgS0 4 
(ImM), CaCl 2 (0.1 mM), thiamine (lug ml -1 ), leucine 
(20(.igml _1 ) and thymine (40 ugml -1 ). 

DNA manipulation and related procedures 

Enzyme reactions, DNA preparation, agarose gel 
electrophoresis and transformation with plasmid DNA 
were carried out using standard procedures, according to 
suppliers' recommendations where applicable (restriction 
endonuclease, DNA polymerase and ligase, New England 
BioLabs; polymerase chain reaction with Phusion, 
Fermentas; plasmid and gel-fractionated fragment 
DNAs, Qiagen or Promega). SopB mutations were 
introduced in pYAS6 and pYAS47 plasmids by quick- 
change mutagenesis and verified by DNA sequencing 
(Supplementary Figure S4). 
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Figure 1. Model of SopB-.ro/>C interaction. (A) Schematic representation of SopB. Top; amino acid numbering of SopB to scale. Middle; secondary 
structure of SopB. Helices and arrows represent a-helix and P-sheet, respectively, as determined by crystal analysis in blue or predicted (predator, 
JPRED) in green. Bottom; functional domains and motifs of SopB. Thick cylinders (from amino acids 155-272 and 275-323) represent the parts of 
SopB structure that have been solved, whereas thin cylinders (from amino acids 1-155 and 272-275) have not (26). (B) Representation of SopB-so/;C 
structure drawn from accession number 3EZF (26), with R219 colored in red, R190, K191 and R195 in blue and the guanine 7 in green. Helices 2 
and 3 of the HTH motif are colored in dark blue. (C) Predicted interactions between amino acids of SopB with sopC nucleotides, adapted from the 
study by Pillet et al. (27). R190, R219, R195 and K191 correspond to arginines 190, 195, 219 and lysine 191. Straight and dotted lines represent 
suggested amino acid-nucleotide interactions. R and Y stand for purine and pyrimidine nucleotides, respectively. Green zones represent the subunits 
of an SopB dimer. 



Protein purification 

Intein-tagged SopB-G324, SopB-R219 mutants, ParBp pi „- 
G291 and ParB/. / ,„ r G291-R194A proteins were purified 
from strain C2833 essentially as described previously 
(13), except that fractions were directly collected and 
frozen at — 80° C after elution from the chitin column. 
ParB Bt .„ ; -G306 does not bind to the chitin column. 
ParB Brra -G306 and ParB &£J „-G306-R192A were prepared 
as described earlier in the text, except that Frt was 
incubated directly with dithiothreitol to induce full intein 
cleavage (see Supplementary Figure S3) and was stored in 
aliquots at — 80°C. ParB proteins from Been were assayed 
directly with crude extracts. 

Plasmids stability assay 

Experiments were started from colonies of cells freshly 
transformed with the plasmids under test. SopB alleles, 
expressed from a pSClOl derivative under plac, were 
assayed in strain MCI 061. Overnight cultures in M9- 
glucose-casamino acid medium containing selective 
antibiotics were diluted 400-fold into the same medium 
and grown to A 600 = 0.25. Samples were then diluted 
serially into fresh M9 medium without the antibiotic 
selective for the plasmid under test and were processed 
as described previously (28). To determine the fraction 
of cells that retained the plasmid, samples were taken at 
the beginning and after 20 and 40 generations. The 
calculated loss rate per generation is given by the slope 
A of the equation (y = B.e~ Ax ) from the fitted curves 
(using Graph Pad Prism 4 software) on graph plotting 
the per cent of cell mini-F + as a function of the generation 



time, which provides a good approximation of the loss 
frequency (f) when <10% [f = 1 — (cell mini-F + /total 
cell) 1/n , where n is the generation number]. ParB alleles 
of Been or Pput were assayed for the stabilization of 
pDAG504 in same conditions as previously published 
(29,30). Briefly, for Been alleles, plasmids under test 
were introduced in strain DH10B, and growth cultures 
were performed in M9-glucose-casamino acid medium 
at 37°C. For Pput alleles, plasmids under test were 
introduced into strain DH5a, and growth cultures were 
performed in LB at 37°C. 

Electromobility shift assay 

Experiments were performed as previously described (9), 
using 1 nM radiolabeled DNA probes in the presence of 
sonicated salmon sperm DNA as competitor (100 ugml -1 ). 
Binding reaction products were resolved at 4°C by electro- 
phoresis on 6% polyacrylamide gels in TGE (25 mM Tris 
base, 190mM Glycine, 1 mM Ethylenediaminetetraacetic 
acid) for 2-A h at 180 V. Polyacrylamide gel 
electrophoresis-purified complementary oligonucleotides 
(Sigma-Aldrich) were used to assemble duplex DNA 
probes by annealing (only top strands are shown; 16-bp 
SopB/ParB binding sites are indicated by upper case 
letters; bases corresponding to positions 7 and 10 are 
indicated in bold): sopC2: 5'-attagtcTGGGACCACGGT 
CCCActcgtat-3'; sopC63: 5'-attagtcTGGGACAACTGTC 
CCActcgtat-3'; sopC64: 5'-attagtcTGGGACTACAGTCC 
CActcgtat-3'; parS66: 5'-attagtcTGTTCCACGTGGAAC 
Actcgtat-3'; and parS67: 5'-attagtcTGTTTCACGTGAAA 
CActcgtat. 
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RESULTS 

We have previously investigated the bases involved in the 
specific recognition of the F plasmid centromere by SopB 
[(27); summarized in Figure 1C]. In particular, the guanine 
at position 7 (sopC-Gl) in the left arm (and its counterpart 
at position 10 in the right arm) was found to play a role in 
SopB-sopC interaction. In addition, the co-crystal 
structure of SopB-sopC complexes, determined at a 
medium resolution [~3 A; (26)], suggests that G7 is 
close to an arginine residue (R219), in the major groove 
(Figure IB). We estimated the distance between the O 
group of G7 and the NH group of R219 at 2.6 A 
(Supplementary Figure SI), compatible with hydrogen- 
bonding between them (31). This R219 residue does not 
belong to the HTH motif responsible for the primary 
binding to DNA (Figure 1A), which contains three 
positively charged residues (R190, K191 and R195; 
Figure IB) involved in specificity of binding to sopC 
(13,26). SopB-R219 is part of an a-helix separated from 
the HTH by an other a-helix (Figure 1A). Its potential 
role in centromere binding and thus in the partition 
process was further investigated. 

SopB-R219 mutants are unable to stabilize mini-F 

To determine whether the R219 residue plays a direct role 
in the partition of the mini-F plasmid, we constructed two 
SopB-R219 mutants by converting R219 to alanine 
(R219A) or lysine (R219K). These mutations were 
introduced into a pSClOl derivative, pYAS47, which 
carries sopAB under the control of the lac promoter, 
yielding pJYB161 (sopB-R2\9A) and pJYB162 (sopB- 
R219K). These plasmids were introduced into strain 
MC1061 carrying pDAG209, an unstable mini-F sopC^ 
but AsopAB, and loss of the mini-F during growth 
without selection was measured (Supplementary Figure 
S2A and Table 1). When wild-type SopB was provided 
in trans from pYAS47, no cell without pDAG209 was 
observed after 40 generations, indicating that the mini-F 
sopC+ was completely stable (loss rate < 0.01%). Without 
SopB (pAM238), the mini-F was lost at a frequency of 
4.0% per generation as expected from the copy number 
(~5-6 at division) for segregation at random. The mutant 
alleles SopB-R219A (pJYB161) and -R219K (pJYB162) 
failed to stabilize pDAG209, which was lost at 3.4 and 
3.1% per generation, respectively. Western blot analysis 
showed that in the stability assays, the mutant SopBs were 
produced at the same level as wild-type (wt) 
(Supplementary Figure S2B). These results indicate that 
the R219 residue of SopB is essential for the maintenance 
of mini-F during generation. 

SopB-R219 variants do not bind sopC centromere 

To test whether the inability of sopB-R2l9 alleles in 
stabilizing the mini-F is primarily because of a default in 
centromere-binding activity, we performed electromobility 
shift assays (EMSA). Wild-type SopB and R219 variants 
were expressed as intein-tagged proteins and purified as 
native proteins after cleavage of the intein (see 'Materials 
and Methods' section and Supplementary Figure S3). 



Typical EMSA experiments are shown in Figure 2 A. 
The 32 P-labeled substrate, a 30-bp DNA duplex 
containing the 16-bp consensus SopB-binding site, was 
specifically bound by SopB as previously observed (27). 
By contrast, SopB-R219A showed almost no sopC- 
binding activity: at 1 uM, the amount of protein-DNA 
complex formed is much less than with 10 nM of wild- 
type SopB, indicating that the loss of sopC-binding 
activity in SopB-R219A is >200-fold. The arginine 219 
of SopB is, therefore, essential for specific SopB binding 
to the centromeric site. Similarly, SopB-R219K showed a 
significant reduction in .yo^C-binding activity, although 
slightly less than SopB-R219A. The ~ 100-fold loss in 
sopC binding indicates that the lysine present in this 
variant is not sufficient to compensate for the positive 
charge of the arginine residue in wild-type SopB. 

In addition to its site-specific DNA-binding activity, 
SopB also binds non-specific DNA (9), for spreading 
along the DNA outside of sopC (25). To determine the 
extent to which SopB-R219A variant is deficient in 
DNA-binding activity, we also tested its ability to still 
bind DNA non-specifically. For this, we performed 
EMSA in the absence of competitor DNA, using a 114- 
bp duplex DNA that does not carry the 16-bp sopC site 
(Figure 2B). In these conditions, SopB-wt and SopB- 
R219A bound similarly the non-specific DNA probe. 
For both alleles, one discrete complex (NS1) was 
observed between 100 and 200 nM, and a second discrete 
complex (NS2) was detected between 200 and 500 nM. At 
~500nM SopB, ~50% of the probe was shifted, and 
above this concentration, high molecular weight 
complexes were formed. Thus, both proteins displayed 
about the same affinity for non-specific DNA, indicating 
that SopB-R219 is deficient in sopC binding only. This 
result strengthens the finding that the R219 residue of 
SopB is strongly involved in site-specific DNA binding. 

In vivo SopB-R219A does not form foci 

SopB, like all ParB proteins, forms bright foci in cells 
carrying its cognate centromere (32). These foci 
correspond to the assembly of partition complexes on 
centromeres and are usually located near the cell center 
or the quarter positions (33). To determine whether SopB 
binding to sopC was also affected in vivo by the alanine 
mutation at position 219, we constructed mini-F 
derivatives, pJYB214 and pJYB223, with Venus-Yfp 
fused to SopB-wt and SopB-R219A, respectively. 
pJYB214 was fully stable in strain DLT1215 (data not 
shown), indicating that the fusion is functional. In this 
strain, DLT2950, fluorescence microscopy showed that 
all cells harbor two to four bright regularly positioned 
foci (Figure 3, left panel). By contrast, cells transformed 
with pJYB223, which carries the sopB-R2l9A allele, did 
not form foci, and fluorescence was spread throughout the 
cytoplasm (Figure 3, right panels). In addition, some cells 
(~15%) did not display fluorescence, suggesting that they 
had lost the mini-F, as expected for an unstable mini-F 
derivative. The absence of focus formation implies that 
SopB-R219 does not support centromere binding in vivo. 
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Table 1. Stability of mini-F plasmids carrying F sopC or chromosomal parS centromeres 



F plasmids 


Multicopy plasmids 


Relevant characteristics 


Loss/generation (%) 


pDAG209 


p/\lvlZ JO 




4.U t ± U.J) 


(sopC) 


pYAS47 


sopA + B + 


<0.01 




pJYB161 


sopA + B-R219A 


3.4 (±0.3) 




pJYB162 


sopA + B-R219K 


3.1 (±0.1) 


pDAG551 


pBBRlMCS5 




2.6 (±0.1) 


(par S) 


pDAG562 


Been parA + B + 


0.15 (±0.1) 




pJYB167 


Been parA + B-R192A 


2.7 (±0.4) 


pDAG504 


pAM238 




3.6 (±0.7) 


(parS Pput ) 


pDAG503 


Pput parA + B + 


0.25 (±0.1) 


pJYB168 


Pput parA + B-R194A 


2.1 (±0.5) 



Derivatives of strain MC1061 carrying the mini-F pDAG209 and the indicated plasmids were grown at 30°C. Strains carrying the mini-Fs pDAG551 
(consensus parS) or pDAG504 (parSp pM ) were grown at 37°C. The sop and par operons are under the control of the lac promoter on pAM238 or 
pBBRlMCS5. Samples of the cultures were plated at intervals for estimation of the proportion of mini-F-positive cells. Loss rates per generation, 
given by the slopes (A) of the fitted curves as presented in Supplementary Figure S2A, are average of at least three experiments, and standard 
deviations are indicated in brackets. 



A 

SopB 

B1 



WT 



R219A 



R219K 



sopC2 




R219A 




Figure 2. Interaction of SopB wild-type and R219 variants with DNA. 
The ability of wild-type SopB, SopB-R219A and -R219K to interact 
with an sopC DNA duplex or with non-specific DNA was tested in a 
gel mobility shift assay. Reaction mixtures, assembled on ice and 
incubated at 30°C, were analyzed by electrophoresis on polyacrylamide 
gels. (A) SopBs binding to sopC centromere site. 32 P-labeled 30-bp 
sopCl probe was incubated alone (— ) or with increasing concentrations 
(from left to right; 10, 30, 100, 300 and 1000 nM) of wild-type SopB 
(WT), SopB-R219A or SopB-R219K as indicated on top, in the 
presence of lOOugmL 1 competitor DNA. Positions of DNA probes 
and protein complexes are indicated on the left: Bl denotes the 
specific complex, whereas asterisk corresponds to a specific shift from 
a secondary form of the duplex DNA. (B) SopBs binding to non- 
specific DNA probe. 32 P-labeled 114-bp repE probe was incubated 
alone (— ) or with increasing concentrations (from left to right; 100, 
200, 300, 400, 500, 600, 700, 800, 900, 1000 and 2000 nM) of wild- 
type SopB (WT) or SopB-R219A as indicated on top, in the absence 
of competitor DNA. NS1 and NS2 denote discrete complexes, and 
HMW corresponds to high molecular weight complexes. 




Figure 3. SopB-R129A allele does not form foci cells. Left; localization 
of SopB-Venus fusion. Fluorescent foci from the mini-F expressing 
SopB-Venus display the expected pattern for partition complexes 
faithfully partitioned. Right; localization of SopB-R219A-Venus 
fusion. All cells displayed diffuse fluorescence throughout the 
cytoplasm. Cells without fluorescence are presumed to lack the mF 
carrying SopB-R219A-Venus as expected from the loss rate (see 
Supplementary Figure S2A and main text). 



Guanines 7 and 10 in sopC site are critical for SopB 
binding 

The SopB-sopC co-crystal structure (26) predicts that the 
R219 residue of an SopB monomer is near the guanine in 
position 7 of the left arm of sopC, and thus that R219 of 
the second SopB monomer should be close to the guanine 
10 on the right arm (Figure IB and C). We previously 
observed by SPRi that mutations of these bases affect 
SopB binding to sopC (27). However, because of the low 
sensitivity of this technique, quantification of the loss in 
DNA-binding activity was not possible. Because SopB- 
R219 variants are deficient in sopC binding, we expected 
that if R219 directly interacts with guanines 7 and 10, we 
should observe a comparable loss in SopB binding if these 
bases are mutated. 

We, therefore, performed EMSA in same conditions as 
described earlier in the text with wild-type SopB but with 
30-bp sopC DNA probes carrying either transversion (C 
to A; probe sopC63) or transition (C to T; probe sopC64) 
at positions 7 and 10 (Figure 4). Both probes were barely 
shifted even at the highest SopB concentration (1 uM), 
indicating that these bases are strictly required for the 
formation of SopB-sopC complexes. This result is 
consistent with a direct interaction between R219 
residues and guanine at positions 7 and 10. 
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sopC2 



sopC63 



sopC64 



SopB - 



B1 



sopC 




G7; G10 



T7; T10 



A7: A10 



1 



4 7 10 13 16 
I I I I I I 
5' - tGgGACCRYGGTCcCa - 3' 
3' - aC cCTGGY RCCAGgGt - 5' 

Figure 4. Interaction of wild-type SopB with mutated sopC 
centromeric DNA. Top; the ability of wild-type SopB to interact with 
different sopC DNA duplexes (indicated on top) was assayed as in 
Figure 2. 32 P-labeled 30-bp sopC probes were incubated alone (— ) or 
with increasing concentrations of wild-type SopB (from left to right; 10, 
30, 100, 300 and 1000 nM), except for sopC2 where the 1000 nM SopB 
reaction was omitted. Legend is as in Figure 2. Bottom; bases at 
positions 7 (bottom strand) and 10 (top strand) are indicated in grey. 
Numbers indicate positions of the bases in the palindromic SopB- 
binding site according to the study by Pillet et al. (27). 



Chromosomal CBPs harbors extended DNA-binding 
domains essential for partition activity 

The structure of a chromosomal homologue of SopB, 
SpoOJ of Thermits thermophilus, solved in the absence of 
the parS centromeric DNA, also displays a typical HTH 
proposed to be the /?arS-binding motif (34). Interestingly, 
a helix (H9) located two helices away from the HTH 
recognition helix (H7) was shown to correspond to helix 
H8 of KorB (35), and it also aligns with helix H5 of SopB 
(Figure 6A). H5 Sop b and H8kofb carry the R219 and the 
R240 residues, respectively, involved in specific 
recognition of their cognate centromere [this study, (22)]. 
Helix H9s P ooj also harbors an arginine residue (R179) in 
the corresponding helix (Figure 6A). In the 3D structure 
of the SpoOJ dimer, the orientation of R 179 relative to the 
HTH supports the possibility that it could provide a direct 
interaction with a specific base of parS (Figure 6B). 
Indeed, each arginine residue on both monomers has its 
side chain directed towards where the parS DNA would be 
bound in the HTH motif alignment. 

Sop0J r ,/, e binds the pseudopalindromic parS sequence 
5'-TGTTCCACGTGAAACA-3' . This sequence corres- 
ponds to the highly conserved parS, characteristic of 
centromeres present on bacterial chromosomes, whose 
consensus is 5'-tGTTNCACGTGNAACa-3' . If the 
SpoOJ-R179 residue is involved in specific binding, we 
expect that it should be conserved at least in a subset of 
chromosomal ParBs. To test this hypothesis, we aligned 
ParBs from a large range of bacteria (Figure 6C). We 
found that R179 is highly conserved and present in most 



phyla. In the e-Proteobacteria, the Chlamydiales and the 
Spirochaete groups, the arginine is replaced by a lysine, 
suggesting that the positively charged nature of the residue 
is the important feature, compatible with a direct 
interaction with a base of the parS site. If the H9 helices 
of ParB are functional equivalents to the H5 helix of 
SopB, these R/K residues should also be involved in 
chromosome segregation. 

ParABS partition activity on bacterial chromosome is 
difficult to be assessed directly because of pleiotropic 
effects. However, partition function could be examined by 
the ability of parAB expressed from plac on a multicopy 
vector to stabilize a mini-F (Asop) plasmid carrying a single 
parS in E. coli in trans. Indeed, this assay has been used 
previously to demonstrate the partition capacity of the 
Bacillus subtilis soj-spoOJ-parS and the Pseudomonas 
putida and Burkholderia cenocepacia parABS systems 
(29,30,36). To determine whether these positively charged 
residues are involved in partition, we tested the ParABS^,,,, 
and ParABS Sc£ ,„ systems in stability assays by introducing 
mutations at positions equivalent to SpoOJ-R179, parB- 
R194A and parB-R192A, respectively. We observed, as 
previously reported using the same experimental conditions 
(29,30), that both ParAB systems were able to efficiently 
stabilize their cognate parS carrying mF, pDAG504 or 
pDAG551 (Table 1; loss rates = 0.15-0.25%). We found, 
however, that both parB-R194A and parB-R192A alleles 
were impaired in their ability to support stabilization of 
pDAG504; Been ParB-R192A was completely deficient 
(loss rate = 2.7%), whereas Pput ParB-R194A was highly 
but not completely deficient (loss rate = 2.1%). These 
results thus clearly demonstrate that these positively 
charged residues in H9 helices are essential, as is SopB- 
R219, for the partition function of chromosomal ParBs. 

By performing EMSA, we then tested whether these 
arginine residues are, as for their SopB counterparts, 
directly involved in /wrS-specific binding (Figure5). ParB 
proteins from P. putida were purified as for SopBs, 
whereas ParB proteins from B. cenocepacia were directly 
assayed from crude extracts expressing intein-tagged 
ParBs (see 'Material and Methods' section and 
Supplementary Figure S3). In contrast to wild-type, 
mutant ParBs from Pput and Been were not able to bind 
to their respective parS sites. In both cases, the DNA 
binding deficiency is so high that no specific retardation 
was detected. These results thus indicate that R194 and 
R192 residues from Pput and Been, respectively, are 
directly involved in centromere binding. 

Finally, we found that the R/K residues of H9 helices 
are part of a highly conserved motif, GHA R / K A /vLL, that 
covered most of the H9 helix (Figure 6C). Secondary 
structure analysis on these motifs predicted that they all 
harbor an a-helical conformation (data not shown). We 
named this conserved motif box 'CBM2' for centromere- 
binding motif 2; the CBM1 being the typical HTH motif. 



DISCUSSION 

SopB is known to bind DNA specifically on the sopC 
centromere to function in the active partitioning of the 
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Figure 5. Chromosomal ParBs binding to parS sites. DNA binding 
interaction between Pput or Been parS, left and right panel, 
respectively, and wild-type ParB from Pput (ParB/.,,,,,), Been 
(ParB B „„) or variant ParB-R192A a ,,,„ and ParB-R194A /> „, proteins 
were assayed by EMSA as in Figure 2. 32 P-labeled 30-bp parS probes 
were incubated alone (— ) or with increasing concentrations of purified 
ParBp,,,,, proteins (left panel; from left to right: 0, 10, 30, 100 and 
300 nM) or ParB B „„ proteins in crude extracts (right panel; from left 
to right: 0, 2.8, 8.4, 28 and 84mgmP'). Legend is as in Figure 2. 



F plasmid. We have shown here that the specificity of 
SopB-sopC interaction is more complex than anticipated, 
by identifying a new specific DNA-binding determinant, 
an arginine at position 219, located outside of the HTH 
motif. We demonstrated that SopB-R219A/K variants are 
completely deficient in sopC binding in vitro, whereas both 
mutants are still able to bind DNA non-specifically 
(Figure 2). Fluorescence microscopy revealed that, in 
contrast to wt SopB, neither mutant allele was able to 
form intracellular foci (Figure 4), supporting the idea 
that in vivo they are also deficient in sopC binding. In 
addition, these single mutations totally abolished active 
segregation (Table 1), demonstrating that the R219 
residue is a crucial DNA-binding determinant of the 
SopB-sopC interaction. Extension of this result to 
chromosomal partition systems reveals that most ParBs 
also harbor a secondary CBM that was shown to be 
essential for partitioning functions (Figure 6 and Table 1). 

Direct interaction between R219 and guanine 7 defines an 
extended DNA-binding domain in F SopB 

HTH motifs are composed of two helices that contact the 
DNA in the major groove. The second helix, called the 
recognition helix, usually provides specificity by 
contacting specific bases of the target binding sequence. 
In the case of SopB, the recognition helix makes three 
specific base contacts through positively charged residues 
(K190, K191 and K195) and possibly two others through 
residues S189 and 1192 (26). These numerous contacts 
were thought to be sufficient for providing a high 
specificity for centromere binding. The proximity evident 
in the SopB— sopC structure of the positively charged 
residue, R219, located two helices away from the HTH, 
to guanine at position 7 (G7) was thus intriguing, 
especially because G7 was found be involved in 
SopB binding (27). We found the same strong deficiency 
of binding, as judged by gel retardation, between 



SopB-R219A binding to sopC and wild-type SopB 
binding to sopC carrying either transitions or transver- 
sions at positions 7 and 10 (compare Figures 2A and 4). 
In addition, replacing the arginine by a lysine at position 
219 does not restore binding to sopC (Figure 2A). A 
positive charge at this position is thus not sufficient for 
DNA binding, arguing for specific contact between SopB- 
R219 and sopC-Gl. In agreement with this finding, 
bioinformatic analysis (27) showed that (i) the arginine 
at position 219 is conserved among all closely related 
SopB homologues; and (ii) all sopC centromere sequences 
present in the databases harbor a guanine at position 7. 
Along with these data, our results thus clearly support a 
direct interaction between R219 residues of each SopB 
monomer with each G7 base of the palindromic sopC 
sequence. 

If the R219-G7 interaction was anticipated from the 
structure, the finding that it is a critical one was not. 
Single mutations at position R219 completely abolish 
sopC recognition, revealing that the HTH is not sufficient 
to provide the specificity of the SopB-sopC interaction. 
However, specific contacts made by the recognition helix 
are also crucial for binding specificity, as mutations of two 
of the positively charged residues, K191 and K195, also 
abolish specific, but not non-specific, sopC binding (13). 
The centromere-binding specificity of plasmid F is thus 
shared by two motifs, the HTH and the R219 containing 
helix, which together define an extended DNA-binding 
domain. 

Centromere-binding proteins harbor extended 
DNA-binding domains for highly specific interactions 

Is this extended centromere-binding domain present in 
SopB a common characteristic among CBPs? Some co- 
crystal structures of ParB-parS are available to address 
such a question. Similarly to SopB, KorB of RK2/RP4 
harbors two DNA-binding motifs (22). However, in 
sharp contrast to SopB, residues from the helices 
forming the HTH motif of KorB do not make specific 
contacts with bases in the 0 B site (13-bp parS centromere). 
The KorB-0 B recognition specificity is mainly based on 
two side chain interactions outside the HTH (T211 and 
R240) that form direct hydrogen bonds with two bases 
next to the central base pair binding sequence. Thus, 
although the specificity determinants are not distributed 
as for F SopB, KorB also has an extended centromere- 
binding domain. 

In the case of the PI ParB, the specificity recognition is 
shared by two separate DNA-binding motifs, each 
contacting its specific binding sequence, the A- and 
B-boxes; the PI centromere site being composed of a 
complex arrangement of these two DNA-binding boxes 
(37,38). The A-boxes are specifically bound by the HTH 
motifs, and the B-boxes are bound by the dimerized dimer 
motifs (21). These two motifs participate to the efficient 
binding of ParB to parS; therefore, they could be 
regarded as an extended DNA-binding domain. Although 
PI ParB and F SopB are closely related to each other, SopB 
does not carry this second DNA-binding motif because of 
the lack of two small extra loops residues providing 'wings' 
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Figure 6. ParB conserved motifs and residues involved in centromere binding. (A) Amino acid alignment of ParB homologues. Sequence alignments 
of SopB (F) with KorB (RP4) and SpoOJ (T. thermophilic) are based on the study by Leonard et al. (34). Identical residues and highly conserved 
residues are indicated by asterisks and dots, respectively. Helices HI to H5 of SopB, based on the crystal structure analysis (26), are schematically 
represented on top; Helices H2 and H3, corresponding to the HTH motif, are drawn in yellow. Arginine residues R219 (SopB), R240 (KorB) and 
R179 (SpoOJ) are colored in red. (B) Ribbon diagrams of a ParB dimer. Structure is from T. thermophilus SpoOJ (34). One monomer is colored in cyan 
and the other in green. The right diagram is a 90° horizontal rotation of the left diagram. HTH motifs and R179 residues of each monomer are 
colored in yellow and red, respectively. Each arm of the 16-bp parS DNA is expected to bind one HTH motifs of the SpoOJ dimer. Orientations of 
R179 residues are compatible for providing additional specificity contacts in parS DNA. (C) Amino acid alignment of chromosomal ParBs. ParBs 
from a large spectrum of bacterial species were aligned with ClustalW; in the case of multichromosomal bacteria, ParBs are from the main 
chromosome. These ParBs are expected to bind the parS consensus sequence 5'-tGTTNCACGTGNAACa-3' (30,48). Sequences highlighted in 
yellow and orange correspond to the HTH motif and the CBM2 (centromere-binding motif 2) box, respectively. Helices are represented as in (A) 
based on the crystal structure of SpoOJ of T. thermophilus (34); amino acid sequence are numbered according to Tthe SpoOJ. The red plus sign (+) 
schematically represents the conserved R or K positively charged residues. ParBs are from Borrelia burgdorferi B31(B/w), B. cenocepacia (Been), B. 
subtUis (Bsub), Campylobacter jejeuni (Cjej), Caulobacter crescentus (Cere), Chlamydia trachomatis (Ctra), Helicobacter pylori (Hpyl), Mycobacterium 
tuberculosis (Mtub), Myxococcus xanthus (Mxan), Neisseria meningitidis (Nine), P. putida (Pput), Porphyromonas uenonis (Puen), Rickettsia typhi 
(Rtyp), Staphylococcus aureus (Saur), Streptomyces coelicolor (Scoe), Streptococcus pneumonia (Spne), T. thermophilus (Tthe) and Vibrio cholera (Vcho). 



that are specifically present in the ParB dimerization motif 
(26). This difference is in relation with the centromere 
structures where, in the particular case of PI, ParB 
recognizes two different binding sequences. Thus, as in 



KorB and SopB, PI ParB has an extended DNA- binding 
domain composed of two centromere-binding motifs. 

In the course of this study, we have identified the SopB- 
R219 counterpart present on chromosomal ParBs in most 
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bacterial phyla (Figure 6C). As for SopB, this residue 
present in the two chromosomal Par systems tested here 
is essential for the partition function (Table 1) and is 
absolutely required for binding to its cognate centromere 
sequence (Figure 5). In addition, it belongs to a newly 
identified and well-conserved a-helical motif, 
GHA R / K A / V LL- This motif aligns with helices H5 and 
H8 of SopB and KorB, respectively, which are also 
essential for centromere binding. We, therefore, named 
this new motif CBM2 for centromere-binding motif 2. 
We proposed that chromosomal ParBs also harbor an 
extended centromere-binding domain composed of two 
DNA-binding motifs, the classical HTH (CBM1) and 
the newly identified CBM2. 

We were unable to identify the CBM2 box and the 
positively charged residue on annotated chromosomal 
ParBs in only a few phyla, e.g. in cyanobacteria. The 
CBM2 motif, GHA R / K A /vLL, is also not conserved on 
ParBs from secondary chromosomes in the case of 
bacteria carrying two or three chromosomes (data not 
shown). This could be because of divergent parS sites 
present on these chromosomes. Indeed, in the case of 
secondary chromosomes, parS sequences are different 
from the consensus one present on the main chromosome 
(14,39). It will be of interest to investigate whether a 
second DNA-binding motif is present on these Par 
systems. 

In the case of type lb ParBs that carry RHH DNA- 
binding motifs in place of an HTH, no co-crystal structure 
is available to determine whether specificity determinants 
are also present outside the primary binding motif. 
However, further DNA contacts providing additional 
binding specificity have been suggested by comparison of 
ParG, the ParB analogue of TP228, with the Arc and Mnt 
repressors harboring RHH motifs (40), supporting the 
possibility that type lb ParBs could also carry extended 
centromere-binding domains. 

Initially, the HTH motif of SopB was identified as the 
only determinant for centromere-specific binding (23). 
Our results along with these examples indicate that 
PaiB—parS interactions are not as simple as initially 
proposed. Rather they suggest that bacterial CBPs have 
evolved, using different strategies, to possess extended 
DNA-binding domains. We thus propose that the 
requirement for at least two independent DNA-binding 
motifs is imposed by the need for a high centromere- 
binding specificity. Mutational analysis and co-crystal 
structure determination of other ParB-parS complexes, 
especially from chromosomal Par systems but also 
from ParBs with RHH, will be of particular interest 
to determine the extent of the different strategies 
developed for providing such high specificity for 
centromere binding. 
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